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Abstract. Starting from finding approximate value of a function, introduces the measure 
of approximation-degree between two numerical values, proposes the concepts of “strict 
approximation” and “strict approximation region”, then, derives the corresponding one- 
dimensional interpolation methods and formulas, and then presents a calculation model 
called “sum-times-difference formula” for high-dimensional interpolation, thus develops a 
new interpolation approach — ADB interpolation. ADB interpolation is applied to the in- 
terpolation of actual functions with satisfactory results. Viewed from principle and effect, 
the interpolation approach is of novel idea, and has the advantages of simple calculation, 
stable accuracy, facilitating parallel processing, very suiting for high-dimensional interpo- 
lation, and easy to be extended to the interpolation of vector valued functions. Applying 
the approach to instance-based learning, a new instance-based learning method — learn- 
ing using ADB interpolation — is obtained. The learning method is of unique technique, 
which has also the advantages of definite mathematical basis, implicit distance weights, 
avoiding misclassification, high efficiency, and wide range of applications, as well as being 
interpretable, etc. In principle, this method is a kind of learning by analogy, which and the 
deep learning that belongs to inductive learning can complement each other, and for some 
problems, the two can even have an effect of “different approaches but equal results” in 
big data and cloud computing environment. Thus, the learning using ADB interpolation 
can also be regarded as a kind of “wide learning” that is dual to deep learning. 


Keywords: Approximation-Degree- Interpolation: Strict Approximation: Sum-Times- 
Difference Formula. Instance-Based Learning: Wide Learning 


1 Introduction 


Instance-based learning[1,2] is also called nonparametric approach|3,4]. Instead of establishing 
a global model of sample data, the approach uses sample data to interpolate directly to achieve 
objective function approximation. Examples are the experience, and the specific manifestation 
of a general rule. From the cognitive perspective, the way of learning based on instances is 
closer to human learning. So, it makes sense to give machines this learning ability. Instance- 
based learning has been studied for a long time and many achievements have been made (such 
as k-nearest neighbor algorithm, distance weighted nearest neighbor algorithm, locally weighted 
regression algorithm etc.), but there are still some problems and shortcomings on which (such as 
high-dimensional interpolation and misclassification). Therefore, instance-based learning as well 
as corresponding interpolation technique still needs us to continue to research and develop. On 
the other hand, the current environments of big data and cloud computing undoubtedly provide 
strong support for instance-based learning, and for which also open up a new place to display 
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its prowess. Inspired by the approximate evaluation method of flexible linguistic functions[5, 
6] in reference [7], in this paper, we intend to introduce a measure of degree of approximation 
between numerical values to study the approximate evaluation of numerical functions, and then 
explores new interpolation approaches based on the degree of approximation and corresponding 
instance-based learning methods. 


2 Approximation-Degree, Strict Approximation and Strict 
Approximation Region 


Definition 2.1. Let R be real number field, £o € [a,b] C R, and [ao, Bo] C [a,b] be a neighbor- 
hood of xo, which is called the approximation region of xo. For Vx € [a,b], say x is approximate 
to xo if and only if x € [ao, Bo]. 

Definition 2.2. Let R” be n-dimensional real vector space, U = [a1,b,| x [a2, b2] x ... x 
lan, bn] CR”, and £o = (£10; £29; Eno) C U. For Va= (21, £2, ..., £n) C U, say x is strictly 
approximate to £o if and only if components £1, £2,...,&n of £ are approximate to components 
Zio; T293: Eno Of Lo, respectively, i.e., x; € [a;, Bi] C [ai, bi] ([ai, bi] is approximation region of 
Zio), i = 1,2,...,n;and “square” region [a1, 61] x [a2, 82] X ... x [an, Bn] C U is called the strict 
approximation region of £o. 

In contrast to the strict approximation region in Definition 2.2, we refer to the “circle” region 
centered on point £o as the ordinary approximation region of a) C U. The relation between the 
strict approximation region and the ordinary approximation region of the same (2D) point ao is 
shown in Fig. 2.1. The illustration also shows the relationship between the strict approximation 
and the ordinary approximation. In fact, the reference [8] has stated: the geometric meaning of 
“close to point (£1, £2, .--, £n)” is different from that of “close to x; and close to z2 ,..., and close 
to Tn”. 


Fig. 2.1. An illustration of the relation between strict approximation region and ordinary 
approximation region 


Where the square region is the strict approximation region of point xo, and the circular region is its 
ordinary approximation region. 


Definition 2.3. Let R be real number field, xo € [a,b] CR, and [ag, Bo] C [a,b] be the approxi- 
mation region of xo. Set 


1 Lov = L—Qa0 
Agee anag o~ a0? oneal (2.1) 
1-3 = -T E [X0, Bo] 


Boo Zo— Bo’ 
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to be called the degree of approximation, shortening as approximation-degree, of x to xo. We call 
the function relation defined by the Equation (2.1) to be the approximation-degree function of 
To. 


3 Approximate Evaluation of Functions Based on Approximation- 
Degree 


3.1 Finding Approximate Value of a Univariate Function Based on 
Approximation-Degree 


Let R be real number field, U = [a,b] CR, V = |c, d]CR, y = f(x) is a continuous function 
relation from U to V. In the condition that a pair (xo, yo) of corresponding values of function 
y = f(x) and 2’ approximate to xp are known, find the approximate value of f(z’). 

Let the approximation region of xo be [a1,b1] C [a,b]. According to the definition of the 
approximation-degree function above, the approximation-degree function of £o is 


Za, € [a,zo] 
Aro (t) = 4 R (3.1) 
i [ ah , DE [xo, bı] 


And let the approximation region of yo is [c1, di] C [c,d], the approximation-degree function of 
Yo is 


y— cı 
Zes y € ler Yo] 
Ay (y) = p A ‘ (3.2) 
ands Y€ [yodi] 


It can be seen that the range of approximation-degree function A+, (x) is [0, 1] and which is also 
reversible. In fact, it’s easy to obtain that 


Ana)! _ dy(yo —¢1) +e1, dy € [0, 1] (3.3) 
dy(yo—di)+d1, dy € [0,1] 


where dy is the approximation-degree of y to yo. 

Now we find the approximation-degree A,,(x’), and then set A,,(y’) = Az,(zx’) (that is, 
transmitting the approximation-degree of x’ (to x9) to y’ (to yo)); Further, we derive the required 
approximate value of y’ from approximation-degree A,,(y’) and inverse function A,,(y)~* of 
approximation-degree function of A,,(y). 

It can be seen that the inverse function A,,(y)~' of Ay, (y) is a piecewise function, which has 
two parallel expressions. Thus, substituting approximation-degree A,,(y’) = d into A,,(y)~', 
we can get two y (yı and y2). Then, which y should be chosen as the desired approximate 
value of function? 

Obviously, the desired y is related to the position of x’ relative to xg and the trend (i.e., 
being increasing, decreasing, or a constant) of f(a) near zo. Thus, we have the following ideas 
and techniques: 

(1) Consider the derivative f'(xo) of the function f(x) at point xo. If the derivative f'(xo) 
is known, we can estimate the trend of f(a) near xo according to the f’(ao) being positive, 
negative or zero, and then determine the choice of y’s value. 
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(2) Consider whether there is a point «* on the 2’ side near the point x (which does not 
beyond the approximation region of zo), whose corresponding value of function, f(x*) = y*, is 
known. If there is such a point «*, we can estimate the trend of f(a) between the xo and «* by 
utilizing the size relation between the corresponding y* and yo, and then determine the choice 
of y’s value. For instance, when «*< x’ < xo, if y*< yo, which then shows that the general trend 
of function f(a) is increasing on the sub interval (x*, zo), thus the y1, i.e., that value less than 
yo, should be chosen; while if y*> yo, which then shows that the general trend of function f(x) 
is decreasing on the sub interval (x*, zo), thus the yg, i.e., that value larger than yo, should be 
chosen. 

(3) If the derivative f’(x 9) is unknown and there is no such reference point x*, take the 
average (yı + y2)/2 or take the yo directly as the approximate value of f(z’). 

Due to the space limit, in the following, we only discuss the second method further, and use 
third method to classification problems. As for the first method, it will be introduced in another 
article. 


3.2 Finding Approximate Value of a Multivariate Function Based on 
Approximation-Degree 


Let’s take the function of two variables as an example to discuss this problem. 

Let z = f(x,y) be a function (relation) from [a,b] x [a2,b2] to [c,d]. Suppose a pair of 
corresponding values, ((%9, Yo), 20), of function y = f(x,y) and point (2’,y’) approximate to 
point (xo, yo) are known. In the case that expression of function f(x,y) is unknown or not used, 
find the approximate value of f(z’, y’). 

By the definition of strict approximation, («’, y’) approximate to (£o, yo) is equivalent to a’ 
approximate to £o and y’ approximate to yo. Thus, we can find the approximate values zy and zy 
of function f(x, yo) and f(2o, y) at points x’ and y’, respectively. It can be seen that this is really 
two approximate evaluation problems of univariate functions. Thus, we further imagine that if 
there is respectively an adjacent point (a*, yo) and (xo, y*) in the x-direction and y-direction 
of the point (xo, yo), as shown in Fig. 3.1, whose corresponding function values f(2*, yo) and 
f (xo, y*) are known, then the approximate value z, of f(x’, yo) can be got by utilizing f(x*, yo), 
and the approximate value z, of f(xo,y’) can be got by utilizing f (£o, y*), just like that of the 
previous unary function. Thus, we firstly get separately the approximation-degree A+, (x) and 
A, (y’), then set A,,(z) = Az, (x’) and A,,(z) = Ayo (y’); and then, substitute them separately 
into inverse function A,,(z)~' of Az,(z) and get two pairs of candidate approximations, then 
taking separately f(x*, yo) and f (xo, y*) as reference, choose zy and zy from respective candidate 
values (as shown in Fig. 3.1). 

Having got the approximate values z, and z,, how can we further get the approximate value 
z we need? 

Let 21 = (Zx + 2y)/2 be the average of zx and zy. It can be seen from Fig. 3.2 that zı can 
actually be viewed as an approximate value of f(x,y) at midpoint (denoted by (a1, y1)) between 
(x', yo) and (xo, y). We can see from the figure that 21 < Zo, i.e., the varying trend of function 
values from zp to 2, is decreasing. Set zg — 21 = c1 (cı is the length of segment BC in Fig. 3.3(a)), 
then z1 = zo — c1. According to the varying trend of function values from 2 to zı (i.e., the slope 
of segment AB in Fig. 3.3(a)), also, taking into account that point (71, y1) is just the midpoint 
of segment joining points (2’,y’) and (xo, yo), that is, (x’,y’) — (xo, yo) = 2(@1, y1) — (£0, Yo), 
so we infer that the approximate value of function at point (x’, y’) can be zo — 2c; (as shown in 
Fig. 3.3(a)). Thus, it follows that 
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Fig. 3.1. Utilizing the values of the function at points (x*, yo) and (xo, y*) to determine the 
approximate values of f(x’, yo) and f(xo, y’), respectively 


z = zo — 2c = Zo — 2(zo — 21) = 20 — 2[zo — (Zz + Zy)/2] = Zz + Zy — zo 


Fig. 3.2. Illustration-1 of synthesizing zz and zy into z 


Of course, zı may also be greater than zp or equal to zo. If z1 > zo, then the varying trend of 
function values from zo to zı is increasing (as shown in Fig. 3.3(b)). Set zo — 21 = c2(c2 is the 
length of segment BC in Fig. 3.3(b)), then z1 = zo — c2. Then, according to the varying trend 
of function values from zp to z1 (i.e., the slope of segment AB in Fig. 3.3(b)), we infer that the 
value of function at point (2’,y’) can be zo — 2cg. Thus, 


z = žo + 2c2 = 2o + 2(21 — 20) = Zo + 2[(z, + 2,)/2 — 20] = Ze + Zy — žo 


The third case: z1 = zo. This indicates that the values of function remain unchanged from Zo to 
zı. Thus, we can take z = zo. And by 2 = 2 = (Zx + 2y)/2, it follows that 2z) = Zx + zy. Thus, 


z = 220 — Zo = Zz + Zy — 20 


In summary, we see that, no matter what relationship may be between the average of zy and 
zy and the zg, or no matter how the value of the function varies from zg to 21, the approximate 
value of the function at point (x', y’) can always be taken as 


Z= Zy + ty — 2 (3.4) 
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Fig. 3.3. Illustration-2 of synthesizing zz and zy into z 


This equation is also the calculation model of the approximate value of function of two variables, 

It can be seen that this is actually splitting the approximate evaluation of a function of 
two variables into the approximate evaluation of two functions of one variable, firstly, then, 
synthesizing two obtained approximate values into a value as an approximate value of the original 
function of two variables. Extending this technique of “first splitting then synthesizing” to 
the approximate evaluation of a function of 3 variables, u = f(x,y,z), we obtain the formula 
synthesizing approximate value of the function is 


U = Uz + Uy + Uz — 2u9 (3.5) 


And then, for a function of n variables, y = f (£1, £2, ..-, £n), the corresponding formula synthe- 
sizing approximate value of the function is 


Y = Yar + Yoo +--+ Ye, — (n — 1)yo 


or 7 
y= >> ve, — (n—1)y0 (3.6) 
i=1 


For convenience of narration, we may as well refer to the Equations (3.4), (3.5) and (3.6) as the 
sum-times-difference formula. 


4 Interpolation Based on Approximation-Degree 


Let y = f(x) be a function (relation) from [a,b] to [c,d]. A set of pairs of corresponding values 
of function y = f(x), (1,41), (@2,Y2), -» , (n; Yn), is known, where zı < T2 <,...,< Un. 
Now the question is: in the case that the expression of function f(x) is unknown or not used, 
construct an interpolating function g(x) such that g(#;) = f(a) (i = 1,2,...,n), and for other 
x € [a,b], g(x)xf(x). This is the usual interpolation problem. We now use the approach that 
finding approximate value of a function above to solve the interpolation problem. 

Let a = 21,2, = b, then 21, T2,..., £n is a group of interpolation base points (or nodes). 
We definite the approximation region of xı as [21,22], the approximation region of x; as 
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[vi-1, ziyı](i = 2,3,...,n — 1), and the approximation region of £n as [£n—-1, n], and then 
definite separately the approximation-degree functions of base points £1, x;, and £n as 


LT— Ti 


Az = , £E |z, 4.1 
(2) = EE, sefa] (41) 
a , TE [eit] 
Ti— Ti+? x Ti, itl 
Az, (£) = = , TE [En £n] (4.3) 
Tn — En-1 


Note that it is not hard to see from the above expressions of approximation-degree functions that 
when z € [x1, (x1 +%2)/2], [(£i—-1 +21)/2, (xi +xi+1)/2], or [(@n_-1+4n)/2, £n], the corresponding 
approximation-degrees Az, (x), Az, (x), andAz,„(x) are always > 0.5. This means that x is closer 
to the corresponding base point £1, £i, OTEn. 

We then define the approximation-degree functions of y;(¢ = 1,2, ..., n) in the same principle 
and way. 


Ay. (y) = ae a ~~ 7 —, y € [vi-1; Yi (4.4) 
Ay, (y) = = a I F Ta y € [yi Yi-1 (4.5) 
Ay, (y) = A = I Zy y a Y © [Yin Yit1 (4.6) 
Ay. (y) = aaa Toh Zy ih i, Y E [Yi+1; Yi (4.7) 


Obviously, in the four expressions above, (4.4) = (4.5) and (4.6) = (4.7). Thus, the 4 functional 
expressions can be reduced as two expressions: 


1 Yi-1 
Ai. = 
si ) Yi — Yi-1 4 Yi — Yi-1 
1 Yi+1 
Ay; (y) = y t 


Yi — Yi+1 Yi — Yi+1 


And then, we get the inverse expressions of these two functional expressions: 


Ay (y)~* = dy(ys — Yi—1) + Yi-a (4.8) 
Ay, (y)~* = dy(yi — yeti) + Yi+ı (4.9) 
Here dy = A,,(y) € [0, 1] is the approximation-degree of y to yi. 


Now, set 
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Also, considering that on the sub interval [=+ , x,],the interpolated function y = f(x) may 


being increasing, decreasing, or a constant; while when y = f(x) is increasing,certainly y;-1 < Yi, 

so the corresponding y € [wants yil;when y = f(x) is decreasing, certainly y; < yi—1, so the 

Y-i] 
2 


corresponding y € [yi, ; and when y = f(z) is a constant, yi = yYi—1, so the corresponding 


y = yi € (ML yi] as well as y € [y;, #51]. Thus, when z € [=t x;], the corresponding 
yi and yi—ı are adjacent, and here As, (£) = $=, thus the Expression (4.8), the expression 


of the corresponding inverse function A,,(y)~', becomes 


Ay, (y)~! = 27%i-1 (yi Yi-1) Yi-1 = Yi- Yi-i y 4 LiYi-1—Vi-1yi 


Zi—Ti—1 Ti—Ti—1 Ti—Ti—1 
namely 
Yi — Yi-1 TiYi—1 — Vi-1Yi Ti—1 + Ti 
y= r4 , vel , x5] (4.10) 
Ti — Lj-1 Ti — Lj-1 2 


Similarly, the Expression (4.9) of the inverse function A,,(y)~' becomes 


_ Yi Tt Yin ,  TiYi+ı — Vig yi Ti + Vi41 
y= Tq x [ is | 


4.11 
Le Tipi Zi— tipi ` 2 ( ) 


: 7 + yi] or 


Thus, with the two expressions, we can obtain directly the corresponding y € [ 
[yi H=] Zit: x], and obtain directly the corresponding y € [y;, “t“] or 
ae es) 
2 2 | 


from x € | 
yil] from z € [z;, 

Actually, Equations (4.10) and (4.11) are two interpolation formulas. In this way, we actu- 
ally derive an interpolation approach by using the approximate evaluation of function based on 
approximation-degree. We call this approach to be the approximation-degree-based interpola- 
tion, or ADB interpolation for short. 

Specifically, the practice of ADB interpolation is: take base points a = £1, £2,..., £n = b 
as points of view, according to base points and their approximation regions to partition in- 
terval [a,b] = [£1, £n] into 2n — 2 subintervals as shown in Fig. 4.1, [x1, (£1 + x2)/2], [(£1 + 
£2)/2, £2], [£2, (£2 + £z3)/2], ---, [(@n—1 + £n)/2, £n], as interpolation intervals; then, for evalu- 
ated point x € [(xi—-1 + xı)/2, xi] do interpolating with Formula (4.10), for evaluated point 
x E [x;, (£i + zi+1)/2] do interpolating with Formula (4.11). Since each specific interpolating 
formula implies the trend of interpolated function y = f(x) on the corresponding subintervals, 
therefore, there is no much error between the obtained approximate value and the expected 
value, and there would not occur the case that two y-values are got from an zx. 


Fig. 4.1. Illustration of interpolation intervals partitioned by base points and their approximation 
regions in one-dimensional interpolation 


Example 4.1. Use ADB interpolation to do interpolation for function y = sin z. 
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We take sampled data points of y = sin x, (x;, yi), as follows: 

vi:0x7,0.1 x 7,0.2 x 7,...,1.9 x 7,2 XT. 

Yi : SIN Ti. 
and take interpolation points, 

x : 0 x m, 0.02 x m, 0.04 x m, 0.06 x T, ..., 1.98 Xx 7,2 x T. 
Using our ADB interpolation to do interpolation, the corresponding y-values obtained are as 
follows: 
0 0.0624 0.1249 0.1873 0.2497 0.3118 0.3681 0.4245 0.4808 0.5371 0.5923 0.6369 0.6816 0.7263 0.7710 
0.8133 0.8420 0.8707 0.8994 0.9281 0.9530 0.9629 0.9728 0.9827 0.9926 0.9975 0.9876 0.9778 0.9679 
0.9580 0.9424 0.9138 0.8851 0.8564 0.8277 0.7934 0.7487 0.7040 0.6593 0.6146 0.5653 0.5089 0.4526 
0.3963 0.3400 0.2809 0.2185 0.1561 0.0936 0.0312 -0.0312 -0.0936 -0.1561 -0.2185 -0.2809 -0.3400 -0.3963 
-0.4526 -0.5089 -0.5653 -0.6146 -0.6593 -0.7040 -0.7487 -0.7934 -0.8277 -0.8564 -0.8851 -0.9138 -0.9424 
-0.9580 -0.9679 -0.9778 -0.9876 -0.9975 -0.9926 -0.9827 -0.9728 -0.9629 -0.9530 -0.9281 -0.8994 -0.8707 
-0.8420 -0.8133 -0.7710 -0.7263 -0.6816 -0.6369 -0.5923 -0.5371 -0.4808 -0.4245 -0.3681 -0.3118 -0.2497 
-0.1873 -0.1249 -0.0624 0.0000 
And the effect is shown in Fig. 4.2. 


Fig. 4.2. The effect drawing of ADB interpolation for function y = sin x 


From the interpolation method of the univariate function and the method of finding the approx- 
imate value of the function of n-variables above, we obtain a general n-dimensional ADB inter- 
polation method, that is: splits an n-dimensional interpolation into n one-dimensional interpola- 
tion, then use one-dimensional ADB interpolation to find the corresponding approximate values, 
respectively, and finally synthesize the n approximate values by using sum-times-difference for- 
mula into one value as the approximation value of the function of n variables. The n pairs of 
one-dimensional interpolation formulas are required for n-dimensional ADB interpolation, they 


are as follows: 


ij...s—Yi-1,j...8 71, Yi-1,j...8~T1j_1 Yij.--s 


y = 4 T -r Tı a zx zx 
li li—1 li li—1 
y= Yije T Hitti a yp) T1 Yitl,j...8— C144 4 Yaj...8 
Z1; — Tli} T= Ti 
y= Yij..sTYi—1,j.--s y T2; Yi—1,j...s —T2;_1 Yij...s 
= = 2 = 
T2; T21 T2; —T2i_1 
Vija TELAS yy T2; Yi+1,j...s — T2; 1 Yij...s 


T2; T2441 T2; -T241 
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— Yij...s—Yi-1,j...8 En, VYi-1,j...s—~tnj;_1 Vij... 
y= En, —Tn,; Tn ns ~n. 
i i—1 ng Tni 
— Yij...s—Yi+l, j.s En, VYit1,j...6—~ Eng 1 Yij...s 
y= En. — En. Tn En, —En, 
i i+1 ni Eniti 
where yij...s is the value of function at base point (21,,£2,,...,2n,), 1€., Yaj...s = f (E11, L25, -3 Ens )- 


And the final synthesis formula (i.e., sum-times-difference formula) of approximate value of the 


function is 
n 


y= Ss Yo, — (n — 1)yij...8 (4.12) 
i=1 
here yz, is the approximate value of function that obtained by one-dimensional ADB interpola- 
tion for z;(i = 1,2,...,n). 

Actually, it is not difficult to see that the equation (4.12) can also be said to be the formula 
of multidimensional ADB interpolation, which is also a calculation model of high-dimensional 
interpolation. 

Example 4.2. Using ADB interpolation to do interpolation for function z = ig? — zy", the 
effect is shown in Fig. 4.3. 


10. 


Fig. 4.3. The effect drawing of ADB interpolation for function z = ig? = iy? 


Where the left is the functional graph before interpolating, and the right is the functional graph after 
interpolating. 


Example 4.3. Using ADB interpolation to do interpolation for function of three variables, 
yne ; i i 
u= ze , the effect (slice chart) is shown in Fig. 4.4. 


5 Instance-Based Learning Using ADB interpolation 


In the above, we develop a new interpolation approach, ADB interpolation, from finding ap- 
proximate value of a function. Since ADB interpolation is a local interpolation, it can be used 
for instance-based machine learning. In the following, we present two learning algorithms. 
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Fig. 4.4. The effect drawing of ADB interpolation for function ze~ 


(1) A leaning algorithm for regression problems 


e Put samples in sample set X = {(a’, f(x’))}@1(a@ = (#1,22,...,2n)) into a corresponding data 
structure S in centralized or distributed manner (as training examples); 
e For the currently being queried a = (x4, x, ...,2',): 
e According to its coordinate components 2}, 74,...,27, look up in S' sequentially or in parallel to 
determine a æ" (k € {1,2,...,m}) to which the approximation-degree of x’ is highest; 
e Take x” as the center, and according to the position of x’ relative to x", choose n corresponding 
nearest neighbors «!, x”,...,a” (The data points here are renumbering.) from S, then construct 
the corresponding one-dimensional interpolation formulas 


Ya; ae g(x;|x5, f(a"), x}, f(x"’)) (j = 1,2, ay ml € 11525 sany) 


and then compute Yz, Yzz, ---, Yen Sequentially or in parallel; 


e Return 
Fla’) = hr ye, — (n- 1) F(2") 
Example 5.1. Take the following pairs of corresponding values of function z = —x? — 
y”, ((2i, yj), Zij), as example data: 
zi: -20, -18, -16, ... , -2, 0, 2, ... , 16, 18, 20. 
yj: -20, -18, -16, ... , -2, 0, 2, ... , 16, 18, 20. 
Zij : -11 — iG 


and take the following data points, (x,y), as query data: 

x: -20, -20, 20, -19.5, -17.8, -18, -15.3, -12, -10.2, -10, -10, 0, 0, 10, 10, 5.6, 4.7, -3.4, -1.8, -2.3, -3.6, 
1.2, -5.4, -15.6, -20, -20, -20, -18.3, 18.4, 17.5, 16.2, 14.5, 11.1, -5.4, -12.1, -8.5, -13.9, -7.5, -7.8, -9.8, 
-12.4, -13.5, -14.6, -17.5, -17.8. 

y: -20, 20, -20, -19.5, -17.8, -5, -15.5, 2.5, -10.2, 10, -20, 0, -20, -20, -10, -15.3, -3.8, -13.4, -2.8, -1.9, 
-5.6, -10.2, -6.5, 5.6, 0, -10, 10, 10.4, -18.1, -16.3, -14.4, -12.3, -6.3, -15.8, -8.2, -15.6, 0.9, 1.6, 3.2, 4.6, 
6.6, 2.8, -0.9, 18.6, 13.2. 

we use the above learning algorithm, the corresponding z-values obtained are as follows: 
-800.0000 -800.0000 -800.0000 -762.0000 -634.4000 -350.0000 -476.0000 -151.0000 -208.8000 -200.0000 
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-500.0000 0 -400.0000 -500.0000 -200.0000 -267.0000 -37.8000 -192.8000 -12.4000 -9.6000 -45.6000 - 
106.8000 -73.0000 -276.0000 -400.0000 -500.0000 -500.0000 -444.2000 -667.0000 -573.2000 -470.8000 - 
362.8000 -164.4000 -280.0000 -214.2000 -317.0000 -195.2000 -60.2000 -72.4000 -118.4000 -198.8000 - 
191.8000 -215.8000 -653.8000 -492.4000 


The effect drawing is shown in Fig. 5.1. 


Fig. 5.1. An effect drawing of the learning using ADB interpolation 


Where the grid curve is the graph formed by example data from function z = —x? — y?, and the red 
circles indicate the points obtained by learning using ADB interpolation. 


Example 5.2. Fig. 5.2 below shows an effect drawing of learning using ADB interpolation. 
The example data are taken from the peaks function in MATLAB, and the query data is also 
designed according to this function. Limited by space, these data are omitted. 

As can be seen from the above, this instance-based learning using ADB interpolation has 
the following characteristics: 

e The learning method takes data points (a!) of training examples (a!, f(a!)) as centers to set 
approximation regions and compute approximation-degrees. 

e ADB interpolation is a local interpolation, the training examples involved in interpolation are 
related to the position of the currently queried data point x’ relative to its nearest data point 
x", the number of which is related to the dimension of the vector æ, n-dimensional x only 
involves 1+n training examples ( (a", f(a*)) and (æt, f(ax")), (w?, f(x?)), ..., (@”, f(x”)) ). 
But since the point æ’ is only approximate to the point æ, the corresponding Yz (i = 
1,2, ..., n) are most affected by the example (a", f(a*), and the final synthesized value f(x’) 
is also most affected by (a, f(a*). 

e If distributed storage and parallel processing (including parallel lookup and parallel com- 
putation) are used, the time complexity of corresponding algorithm is independent of the 
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Fig. 5.2. An illustration of the effect of learning using ADB interpolation 


Where the left is the functional graph formed by example data and the right is the functional graph 
obtained by learning using ADB interpolation 


dimension of vector a, and its efficiency is almost equal to that of one-dimensional interpo- 
lation at all time. 

e The interpolation formulas (including sum-times-difference formula) are derived entirely by 
the mathematical method, so they have definite mathematical basis. 

e The sum-times-difference formula is actually a linear combination of coordinate components 
of an interpolation point, and the denominators of the coefficients before each coordinate 
component are separately the difference between the coordinate component and the corre- 
sponding coordinate component of corresponding base point, that is, the distance between 
the two, so, these coefficients happen to also have a function of the weight values. Thus, 
viewed from the form, the sum-times-difference formula is a linear weighted regression for- 
mula. That is to say, the sum-times-difference formula here coincides with the traditional 
local weighted linear regression model. However, in local weighted linear regression, these 
coefficients are determined by searching, while in our ADB interpolation, these coefficients 
are determined by looking up. The former is guided and constrained by error function (e.g. 
E = 5 wex,cx (f(a) — f(a))?), and the latter by approximation-degree function (e.g. 
Ay, (x)). In the sense of approximation-degree, the approximate value f(a’) of the function 
obtained from the sum-times-difference formula is always the most accurate. 

e The accuracy of the returned approximate value f(a’) of a function is positively related to 
the approximation-degree of a’ to «”. 

e Comparing the learning using ADB interpolation with the deep learning, the deep learning 
is to approximate objective function with the strategy of deepening vertically|9], yet the 
learning using ADB interpolation can approximate objective function with the strategy of 
increasing density horizontally. So, the learning using ADB interpolation and the deep learn- 
ing can complement each other, and can even have an effect of “different approaches but 
equal results” in the case of sufficient samples. 


(2) A learning algorithm for classification problems 
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e Put samples in sample set X = {(a’, f(a’))}™@1(@ = (z1, £2, ..., £n), f(a) is a class label) into a 
corresponding data structure S in centralized or distributed manner; 
e For the currently being queried x = (x4, x, ...,2',): 
e According to its coordinate components x}, 74,...,27, look up in S' sequentially or in parallel to 
determine a æ" (k € {1,2,...,m}) to which the approximation-degree of a’ is highest; 
e If such a æ" is found, return 


e Else exit. 


Example 5.3. Suppose that the data points of training examples of a classification problem 
are shown as the black circles in Fig. 5.2, and the small boxes surrounding them are their respec- 
tive strict approximation regions; and the white circles represent data points to be classified. 
Using the learning algorithm that using ADB interpolation to classify these data points, the 
classifying results are shown in the figure. As you can see, there are two queried data points are 
respectively classified to two classes to which the corresponding training example data points 
belong, because they are located respectively in the strict approximation regions of the corre- 
sponding data points, while the two queried data points outside the small boxes are not classified. 


Fig. 5.3. An illustration of applying the learning using ADB interpolation to classification 


It can be seen that for classification problems, the learning using ADB interpolation has the 
following advantages: 

e Although similar to the traditional 1-NN algorithm, the approximation regions in the algo- 
rithm are aimed at the data points of training examples, and which are strict approximation 
regions of square. 

e Since the approximation and approximation regions involved in the algorithm are strict 
approximation and strict approximation regions, for classification problems, learning using 
ADB interpolation avoids, from the source, the problem in traditional instance-based learning 
(e.g. k-NN algorithm) that some datum objects with similar partial attributes are classified 
into a class. 

e The classifying result of learning using ADB interpolation is unique, and there is no the 
phenomenon like k-NN algorithm, that classifying result may change with the change of k’s 
value. 
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6 Summary 


In this paper, started from finding approximate value of a function, we introduced the measure 
of approximation-degree between numerical values, proposed the concepts of ”strict approxi- 
mation” and "strict approximation region”, then, derived the corresponding one-dimensional 
interpolation methods and formulas, and then presented a calculation model called ” sum-times- 
difference formula” for high-dimensional interpolation, thus we developed a new interpolation 
approach — approximation-degree-based interpolation, i.e., ADB interpolation. ADB interpola- 
tion was applied to the interpolation of actual functions with satisfactory results. Viewed from 
principles and examples, the approach is of novel idea, and it has the advantages of simple 
calculations (they are all arithmetic), stable accuracy (benefitting from local interpolation and 
that the approximation-degrees are always not less than 0.5); especially, the approach facilitates 
parallel processing, very suiting for high-dimensional interpolation, and easy to be extended to 
the interpolation of vector valued functions. 

Applying ADB interpolation to instance-based learning, we obtained a new instance-based 
learning method — learning using ADB interpolation, and we also gave several examples of the 
learning. Viewed from principles and examples, the learning method is of unique technique 
(e.g., taking data points of training examples as centers to set approximation regions that are 
also strict approximation regions and compute approximation-degrees). Besides the advantages 
of ADB interpolation, the learning method has also the advantages of definite mathematical 
basis, implicit distance weights, avoiding misclassification (guaranteed by strict approximation), 
high efficiency (benefitting from distributed storage and parallel processing), and wide range 
of applications (which can be applied to the regression or classification problems, and can be 
used for large sample learning or small sample even single sample learning), as well as being 
interpretable, etc. In principle, this method is a kind of learning by analogy, which and the deep 
learning that belongs to inductive learning can complement each other, and for some problems, 
the two can even have an effect of “different approaches but equal results” in big data and cloud 
computing environment. Thus, the learning using ADB interpolation can also be regarded as a 
kind of “wide learning” that is dual to deep learning. 
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